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Abstract 

A  large  deviation  principle  is  established  for  a  general  class  of  stochas¬ 
tic  flows  in  the  small  noise  limit.  This  result  is  then  applied  to  a  Bayesian 
formulation  of  an  image  matching  problem,  and  an  approximate  maximum 
likelihood  property  is  shown  for  the  solution  of  an  optimization  problem 
involving  the  large  deviations  rate  function. 
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1  Introduction 

Stochastic  flows  of  diffeomorphisms  have  been  a  subject  of  much  research  [4, 
15,  13,  6].  In  this  paper,  we  are  interested  in  an  important  subclass  of  such 
flows,  namely  the  Brownian  flows  of  diffeomorphisms  (cf.  [15]).  Our  goal  is  to 
study  small  noise  asymptotics,  specifically,  the  large  deviation  principle  (LDP) 
for  such  flows. 

Elementary  examples  of  Brownian  flows  are  those  constructed  by  solving 
finite  dimensional  Ito  stochastic  differential  equations.  More  precisely,  suppose 
b,  fi,  i  =  1  ,...,to  are  functions  from  x  [0,T]  to  that  are  continuous 
in  ( x,t )  and  ( k  +  l)-times  continuously  differentiable  (with  uniformly  bounded 
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derivatives)  in  x.  Let  Pi,  ■  ■  ■  pm  be  independent  standard  real  Brownian  motions 
on  some  filtered  probability  space  (fi,.F,P,  {.Ft}).  Then  for  each  s  £  [0,T]  and 
x  £  Rd,  there  is  a  unique  continuous  {Ft}— adapted,  Rd— valued  process  </>S:t(x), 
s  <t  <T,  satisfying 


ft  rn  ^ 

4>a,t{x)  =  x+  l  b(pS:r(x),r)dr +  /  fi(4>a,r{x),r)dpi(r).  (1.1) 

Js  j=1  Ja 

By  choosing  a  suitable  modification,  {^s,t,0  <  s  <  t  <  T}  defines  a  Brownian 
flow  of  Cfc—  diffeomorphisms  (see  Section  2).  In  particular,  denoting  by  Gk  the 
topological  group  of  Cfe-diffeomorphisms  (see  Section  3  for  precise  definitions 
of  the  topology  and  the  metric  on  Gk),  one  has  that  <f>  =  <  t  <  T}  is 

a  random  variable  with  values  in  the  Polish  space  Wk  =  C([0,T]  :  Gk ).  For 
£  £  (0,oo),  when  /{  is  replaced  by  eft  in  (1.1),  we  write  the  corresponding  flow 
as  cff .  Large  deviations  for  pe  in  Wk,  as  e  — >  0,  have  been  studied  for  the  case 
k  =  0  in  [18,  3]  and  for  general  k  in  [5]. 

As  is  well  known  (cf.  [16,  4,  15]),  not  all  Brownian  flows  can  be  expressed  as 
in  (1.1)  and  in  general  one  needs  infinitely  many  Brownian  motions  to  obtain 
a  stochastic  differential  equation  (SDE)  representation  for  the  flow.  Indeed 
typical  space-time  stochastic  models  with  a  realistic  correlation  structure  in  the 
spatial  parameter  naturally  lead  to  a  formulation  with  infinitely  many  Brownian 
motions.  One  such  example  is  given  in  Section  5.  Thus,  following  Kunita’s 
[15]  notation  for  stochastic  integration  with  respect  to  semi-martingales  with  a 
spatial  parameter,  the  study  of  general  Brownian  flows  of  Cfc-diffeomorphisms 
leads  to  SDEs  of  the  form 

d<j>Stt{x)  =  F{(f>st{x),dt),  <j>s>8{x)  =  x,  0  <  s  <t  <T,  x  £Rd,  (1.2) 

where  F(x,t)  is  a  Cfe+1-Brownian  motion  (See  Definition  2.2).  Note  that  such 
an  F  can  be  regarded  as  a  random  variable  with  values  in  the  Polish  space 
Wk  =  C([0,  T]  :  Cfc+1(Rd)),  where  Ck+1(Rd)  is  the  space  of  (k  +  1)  times 
continuously  differentiable  functions  from  Rd  to  Rd.  Representations  of  such 
Brownian  motions  in  terms  of  infinitely  many  independent  standard  real  Brow¬ 
nian  motions  is  well  known  (see,  e.g.,  Kunita  [15,  Exercise  3.2.10]).  Indeed,  one 
can  represent  F  as 

ft  °°  ft 

F[x,t)  =  /  b(x,r)dr  +  ^2  ft(x,  r)dPi(r),  (x,  t)  £  Rd  x  [0,  T],  (1.3) 

Jo  t=1  Jo 

where  {/3i}“1  is  an  infinite  sequence  of  i.i.d.real  Brownian  motions  and  b ,  /,;  are 
suitable  functions  from  Rd  x  [0,T]  to  (see  below  Definition  2.2  for  details). 

Letting  ci(x,  y,  t)  =  fi(x ;  t)/{(2/j  t)  for  x,y  £  Rd,  t  £  [0,  T\,  the  functions 

(a,  b)  are  referred  to  as  the  local  characteristics  of  the  Brownian  motion  F.  When 
equation  (1.2)  is  driven  by  the  Brownian  motion  F£  with  local  characteristics 
(ea,  6),  we  will  denote  the  corresponding  solution  by  (jf .  In  this  work  we  establish 
a  large  deviation  principle  for  ( <f>e,Fe )  in  Wk- i  x  Wk- 1-  Note  that  the  LDP  is 
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established  in  a  larger  space  than  the  one  in  which  (<pe,  Fe)  take  values  (namely, 
Wk  x  Wk).  This  is  consistent  with  results  in  [18,  3,  5],  which  consider  stochastic 
flows  driven  by  only  finitely  many  real  Brownian  motions.  The  main  technical 
difficulty  in  establishing  the  LDP  in  Wk  x  Wk  is  the  proof  of  a  result  analogous 
to  Proposition  4.10,  which  establishes  tightness  of  certain  controlled  processes, 
when  A;  —  1  is  replaced  by  k. 

As  noted  above,  the  stochastic  dynamical  systems  considered  in  this  work 
are  driven  by  an  infinite  dimensional  Brownian  motion.  A  broadly  applicable 
approach  to  the  study  of  large  deviations  for  such  systems,  based  on  variational 
representations  for  functionals  of  infinite  dimensional  Brownian  motions,  has 
been  developed  in  [7,  8] .  Several  authors  have  adopted  this  approach  to  analyze 
the  large  deviation  properties  of  a  variety  of  models,  including  stochastic  PDEs 
with  random  dynamic  boundary  conditions  [22],  stochastic  Navier-Stokes  equa¬ 
tions  [21],  and  infinite  dimensional  SDEs  with  non-Lipschitz  coefficients  [20,  19]. 
The  approach  is  a  particularly  attractive  alternative  to  standard  discretiza¬ 
tion/approximation  methods  (cf.  [2])  when  the  state  spaces  are  non-standard 
function  spaces,  such  as  the  space  of  diffeomorphisms  used  in  the  present  paper. 

The  proof  of  our  main  result  (Theorem  3.1)  proceeds  by  verification  of  a 
general  sufficient  condition  obtained  in  [8]  (see  Assumption  2  and  Theorem  6 
therein;  see  also  Theorem  3.5  of  the  current  paper).  The  verification  of  this 
condition  essentially  translates  into  establishing  weak  convergence  of  certain 
stochastic  flows  defined  via  controlled  analogues  of  the  original  model  (see  The¬ 
orem  3.4).  These  weak  convergence  proofs  proceed  by  first  establishing  con¬ 
vergence  for  N— point  motions  of  the  flow  and  then  using  Sobolev  and  Rellich- 
Kondrachov  embedding  theorems  (see  the  proof  of  Proposition  4.10)  to  argue 
tightness  and  convergence  as  flows.  The  key  point  here  is  that  the  estimates 
needed  in  the  proofs  are  precisely  those  that  have  been  developed  in  [15]  for  gen¬ 
eral  qualitative  analysis  (e.g.  existence,  uniqueness)  of  the  uncontrolled  versions 
of  the  flows.  Unlike  in  [18,  3]  and  [5]  (which  consider  only  finite  dimensional 
flows),  the  proof  of  the  LDP  does  not  require  any  exponential  probability  esti¬ 
mates  or  discretization/ approximation  of  the  original  model. 

In  Section  5  of  this  paper  we  study  an  application  of  these  results  to  a  prob¬ 
lem  in  image  analysis.  Stochastic  diffeomorphic  flows  have  been  suggested  for 
modeling  prior  statistical  distributions  on  the  space  of  possible  images/targets 
of  interest  in  the  study  of  nonlinear  inverse  problems  in  this  field  (see  [12]  and 
references  therein).  Along  with  a  data  model,  noise  corrupted  observations  with 
such  a  prior  distribution  can  then  be  used  to  compute  a  posterior  distribution 
on  this  space,  the  “mode”  of  which  yields  an  estimate  of  the  true  image  under¬ 
lying  the  observations.  Motivated  by  such  a  Bayesian  procedure  a  variational 
approach  to  this  image  matching  problem  has  been  suggested  and  analyzed  in 
[12].  A  goal  of  the  current  paper  is  to  develop  a  rigorous  asymptotic  theory  that 
relates  standard  stochastic  Bayesian  formulations  of  this  problem,  in  the  small 
noise  limit,  with  the  deterministic  variational  approach  taken  in  [12].  This  is 
done  in  Theorem  5.1  of  Section  5. 

Other  possible  applications  of  the  results  in  the  current  paper  are  as  follows. 
Large  deviations  for  stochastic  flows  were  studied  in  [18]  in  order  to  obtain  large 
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deviation  estimates  for  finite  dimensional  anticipative  SDEs.  The  results  of  the 
current  paper  are  the  first  step  towards  the  study  of  the  analogous  problem  for 
infinite  dimensional  SDEs.  The  paper  [5]  used  the  LDP  for  stochastic  diffeomor- 
phic  flows  to  study  large  deviation  properties,  as  e  — >  0,  of  finite  dimensional 
diffusions  generated  by  eL i  +  L2,  where  L 1,  L2  are  two  second  order  differential 
operators.  The  analogous  problem  for  infinite  dimensional  diffusions  is  currently 
open;  a  key  ingredient  is  again  the  LDP  for  infinite  dimensional  flows  obtained 
in  the  current  paper. 

We  now  give  an  outline  of  the  paper.  Section  2  contains  some  background 
definitions  of  Cfe— Brownian  motions  and  Brownian  flows.  Section  3  presents 
the  main  large  deviation  result  of  the  paper.  The  key  weak  convergence  needed 
to  prove  this  result,  Theorem  3.4,  is  given  in  Section  4.  Finally,  Section  5 
introduces  the  image  analysis  problem  and  uses  results  of  Section  3  to  obtain 
an  asymptotic  result  relating  the  Bayesian  formulation  of  the  problem  with  the 
deterministic  variational  approach  of  [12]. 

We  generally  follow  the  notation  of  [15].  A  list  of  standard  notational  con¬ 
ventions  is  given  at  the  end  of  the  paper,  but  specialized  notation  is  as  follows: 


•  Let  a  =  (ai,a2,  ■  ■  ■  ,(Xd)  be  a  multi  index  of  non-negative  integers  and 
| a]  =  ai  +  0-2  +  ■  ■  ■  +  cud-  For  an  | a |— times  differentiable  function  /  : 
Rd  ->  R,  set  daf  =  d “/  =  (dxi)J!!“'{dXd)ad  •  For  such  an  /,  we  write 

as  dif.  If  /  =  (/1,  f2,  ■  ■  . ,  fd)  is  an  | a |—  times  differentiable  function 
from  Rd  to  Rd,  we  write  daf  =  (dafi,dafi,...,dafd)  ■  By  convention 

d°f  =  /• 

•  For  to  >  0  denote  by  Cm  the  space  of  to— times  continuously  differentiable 
functions  from  Rd  to  R. 

•  For  any  subset  A  C  Rd,  m>  0,  and  /  £  Cm,  let 

ll/l|m;A=  51  SUp\daf(x)\. 
n^,  xEA 


The  space  Cm  is  a  Frechet  space  with  the  countable  collection  of  semi¬ 
norms  ||/||m;A„,Ai  =  {tf  :  M  <  n}.  In  particular,  it  is  a  Polish  space 
with  a  topology  that  corresponds  to  “uniform  convergence  on  compacts” . 

•  For  0  <  S  <  1,  let 


,,S:A  = 


\ot\=m 


sup 

x,ydzA]x^y 


\daf(x)  -daf(y)  | 

\x-y\5 


and 

Cm,5  ^  jj  e  £rn  .  \\f\\mj.An  <  oo  for  any  n  £  N}  . 
The  seminorms  {||  •  \\m,s-,An,n  £  N}  make  Cm,<5  a  Frechet  space. 
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•  For  m  >  0  denote  by  Cm  the  space  of  functions  g  :  Rd  x  Rd  — >  R.  such  that 
g(x,y ),  i,!/  £  ]Rd  is  to— times  continuously  differentiable  with  respect  to 
both  x  and  y.  Endowed  with  the  seminorms 


where  n  £  N,  Cm  is  a  Frechet  space.  Also,  for  0  <  5  <  1  let 


Then  __ 

Cm’5  =  1 5  G  Cm;  \\g\\Z,S;An  <  oo,  for  any  n  G  n| 

is  a  Frechet  space  with  respect  to  the  seminorms  {||  ■  ||~  s-A,^n  €  N}. 

•  We  write  ||/||m;R<*  as  ||/||m.  The  norms  ||  ■  ||m,a,  ||  •  ||~,  ||  •  ||~>5  are  to  be 
interpreted  in  a  similar  manner. 

.  Let  Cm(Rd)  =  {/  =  (A,  f2, . . . ,  fd)'  -Ji  €  Cm,  i  =  1, 2, . . . ,  d}  and \\f\\m  = 
Et=i  ||/*||m-  The  spaces  Cm’5(Rd),Cm(Rdxd),  Cm’S(Rdxd)  and  their  cor¬ 
responding  norms  are  defined  similarly.  In  particular,  note  that  h  £ 

Cm’S  (Rdxd)  ig  a  map  from  x  Rd  t0  Rdxd . 

•  Let  C™,(5(Mrf)  and  C™’<5(R‘ix<i)  be  the  classes  of  measurable  functions  b  : 
[0,  T]  -►  Cm’S(Rd)  and  a  :  [0,T]  ->  Cm’S(Rdxd)  respectively  such  that 

IN  km, <5=  sup  ||6(t)||m,5  <  oo  and  \\a\\y  s  =  sup  ||a(i)||~  s  <  oo. 

0  <t<T  0  <t<T 

•  We  denote  the  Hilbert  space 


where  the  inner  product  on  I2  is  defined  as  (x,  y)l2  =  E^i  xiVii  X,U  £  h- 
We  denote  the  corresponding  norm  as  ||  •  ||z2. 

•  Given  a  filtered  probability  space  we  define 


A[l-2\  [-predictable  for  all  % 
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•  Define 


SN[k]  =  L=  {MZi  G  L2([0,T]  :  l2)  s.t.  £  Ms)\\lds  <  -/Vj  . 

Endowed  with  the  weak  topology  for  the  Hilbert  space  L2([0,T]  :  l2), 
Sjvfo]  is  a  compact  Polish  space. 

•  Define 

AN[h]  =  {u  G  A[h]  ■  u(lu)  G  5jv,  P-a.s.} . 

We  will  always  consider  Sly  [I2]  with  the  weak  topology,  when  referring  to 
convergence  in  distribution  of  *Sjv [^2] — valued  random  variables. 


2  Preliminaries 

Let  o  denote  the  composition  of  maps  and  let  id  denote  the  identity  map  on  Rd. 

Definition  2.1  (Stochastic  flows  of  homeomorphisms/diffeomorphisms)  A  col¬ 
lection  {0Sit(x)  :0  <  s  <  t  <  T,x  £  Rd}  of  M.d  -valued  random  variables  on 
some  filtered  probability  space  P,  {Ft})  is  called  a  forward  stochastic  flow 

of  homeomorphisms,  if  there  exists  N  G  F,  with  P (N)  =  0,  such  that  for  any 
to  G  Nc: 

1.  (s,t,x)  1— >  (f>s  t(x,LO )  is  a  continuous  map, 

2.  4>s,u(F)  =  <ft,u(  uj)  o  <f>att(u>)  holds  for  all  s,t,u,  0  <s<t<u<T, 

3.  (f>Si3(uj)  =  id  for  all  s,  0  <  s  <  T, 

4-  the  map  cf>Stt(uj)  :  is  an  onto  homeomorphism  for  all  s,  t,  0  <  s  < 

t  <  T. 

If  in  addition  f>Stt(x,ui)  is  k-times  differentiable  with  respect  to  x  for  all 
s  <t  and  the  derivatives  are  continuous  in  (s,t,x),  it  is  called  a  stochastic  flow 
of  Ck—diffeomorphisms. 

We  now  introduce  a  Brownian  motion  with  a  spatial  parameter,  with  local 
characteristics  (a,b).  Throughout  this  paper  we  will  assume  that  (a,  b)  G 
c!fS  (R.dxd)  xC^^K^),  for  some  k  G  N  and  5  G  (0, 1].  Fix  v  such  that  0  <  v  <  5. 

Definition  2.2  (Cfe,y —Brownian  motion)  A  continuous  stochastic  process  {F(t)}t>o 
on  some  filtered  probability  space  (12,  F ,  P,  {Ft})  with  values  in  Ck,u(M.d)  is  said 
to  be  a  Ck’v— Brownian  motion  with  local  characteristics  (a,  b),  if  F( 0),  F(t,+ 1)  — 
F(ti),  i  =  0, 1,  ...,n  —  1,  are  independent  C k,^{Rd)— valued  random  variables 
whenever  0  <  to  <  <  ■  ■  ■  <  tn  <  T,  and  if  for  each  x  G  Kd,  M(x,t)  = 

F(x,t)  —  f*  b(x,r)dr  is  a  continuous  ( d-dimensional )  martingale  such  that 
({ M(x ,  • ),M(y ,  ■ )))t  =  f*  a(x,y,r)dr  for  all  (x,y)  G  Rd  x  Rd. 
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The  existence  of  a  Ck,u— Brownian  motion  with  local  characteristics  (a,  b)  follows 
from  [15]  (see,  e.g.,  Theorem  3.1.2  and  Exercise  3.2.10).  Indeed,  for  any  7  <6 
one  can  represent  F  as  in  (1.3),  where  /)  :  x  [0,T]  — >  M.d  are  such  that  for 

each  t  £  [0, T],  £  Cfe,7(Rd), 

OO 

a(x,y,t)  =  '^2fi{x,t)f-(y,i),  a.e.  t, 

i= 1 


and 


| fi(x,r)\2dr  <  T||a||riM 


<  00. 


In  particular,  note  that  if  F  is  a  Cfc,y— valued  Brownian  motion,  its  finite 
dimensional  restriction  (F(x  1 ,  -) ,  F(x2,  ■),■■■,  F(xn,  •))'  is  an  nd— dimensional 
Brownian  motion  (with  suitable  mean  and  covariance)  for  any  (x\, . . . ,  xn)  £ 
R"d.  If  F  is  as  defined  by  (1.3)  and  {</>t}o<t<i  is  a  continuous  Rd— valued 
{Ft}— adapted  stochastic  process,  the  stochastic  integral  /]*  F(cj>r,dr)  is  a  well- 
defined  d— dimensional  continuous  {Ft}— adapted  stochastic  process  (see  Chap¬ 
ter  3,  Section  2,  pp.  71-86  of  [15]). 


Definition  2.3  Let  F  be  as  in  Definition  2.2.  Then  for  each  s  £  [0,T]  and  x  £ 
Rd,  there  is  a  unique  continuous  Ft  — adapted,  Rd— valued  process  4>s,t{x),  s  < 
t  <  T  satisfying  (f>S}t{ x)  =  x  +  f*  F((j>s>r(x),  dr),  t  £  [s,T].  This  stochastic 
process  is  called  the  solution  of  ltd’s  stochastic  differential  equation  based  on  the 
Brownian  motion  F. 


From  [15,  Theorem  4.6.5]  it  follows  that  {< fis.t}o<s<t<T  as  introduced  in  Defini¬ 
tion  2.3  has  a  modification  that  is  a  forward  stochastic  flow  of  Cfc-difleomorphisms. 


3  Large  deviation  principle 

Given  e  >  0,  let  Fe  be  a  Ck,v— Brownian  motion  on  some  filtered  probability 
space  (fi,  F,f,  {Ft}),  with  local  characteristics  (ea,b),  where  (k,v)  and  (a,  b) 
are  as  in  Section  2.  Without  loss  of  generality  we  assume  that  Fe  is  represented 
as 


Fe(x,  t)  =  [  b(x,  1 
Jo 


■)dr 


fi(x,r)d(3i(r),  (x,t)  £  Id  x  [0,T],  (3.1) 


/=!  ’ 


where  (/?z,/z)z>  1  are  as  in  Section  2.  Note  in  particular  that 


Fe(x,t) 


x,r)dr  =  y/eM(x,t). 


With  an  abuse  of  notation,  when  e  =  en  we  write  Fe  as  Fn.  Observe  that 
{{M(x,‘),Pt(-)))t  =  JoMx’r)dr  for  all  t  £  [0 ,T],  a.s.  Let  (ff  =  {^(a;),0  < 
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s  <  t  <  T,x  £  Rd}  be  the  forward  stochastic  flow  of  Cfc— diffeomorphisms  based 
on  Fe.  With  another  abuse  of  notation,  we  write  </>§  t  as  <j>f  and  <fiE  =  {(jft  (x),  0  < 
t<T,x£  Rd}. 

The  goal  of  this  paper  is  to  show  that  the  family  (cjf ,  Fe)e> o  satisfies  a  large 
deviation  principle  (LDP)  on  a  suitable  function  space,  as  e  — ■>  0.  For  m  £  N, 
let  Gm  be  the  group  of  Cm— diffeomorphisms  on  Rd.  Gm  is  endowed  with  the 
metric 


where 


Am(</>,  ip)  =  Y  P(dacl),da%jj), 

|a|<m 


(3.2) 

(3.3) 


,  ,  ,,  _  1  supp^jy  |<ft(a;)  -  if>(x)\ 

P  ^  1  +SUP|rr|<iV  -  V’(*)l  ' 

Under  this  metric  Gm  is  a  Polish  space.  Let  Wm  =  C([0,T]  :  Gm )  be  the  set 
of  all  continuous  maps  from  [0,T]  to  Gm  and  Wm  =  C([0,T]  :  Cm(Rd))  be 
the  set  of  all  continuous  maps  from  [0,  T]  to  Cm(Kd).  The  space  Wrn  endowed 
with  the  metric  dm(cj),ip)  =  sup 0<t<T  dm((f>(t),tp(t))  and  the  space  Wm  with 
the  metric  dm((j),ip)  =  sup0<t<T  Xm((j)(t),  are  Polish  spaces.  Note  that 
(cj)e,Fe)  belongs  to  Wk  x  W, k  C  W^-i  x  Wk- i  Q  Wk-i  x  VPfc-i-  We  will  show 
that  the  pair  (</>e,  F6)e>0  satisfies  LDPs  in  both  of  the  spaces  Wk-i  x  Wk- i  and 
Wfc_i  x  Wfc_i,  with  a  rate  function  I  that  is  introduced  below. 

Let  u  =  {ui}^x  £  Uiv>i^[y.  Given  any  such  control,  we  want  to  con¬ 
struct  a  corresponding  controlled  flow  in  the  form  of  a  perturbed  analogue  of 
(3.1).  Observe  that  Zt  =  Jo  uiis)dPi(s)  is  a  continuous  square  integrable 
martingale.  For  any  7  <  6  one  can  find  bu  :  M.d  x  [0,T]  x  fl  such 

that  bu(t,u>)  £  Cfc,7(Kd)  for  a.e.  such  that  for  each  x  £  Rd,  bu(x,-) 


is  predictable,  and  such  that  fQ  bu(x,s)ds  =  ({Z ,  M (x ,  ■)}) t  for  each  (x,t)  £ 


M.d  x  [0,T].  In  particular,  for  each  x  £ 
Furthermore,  for  some  c  £  (0,  00), 


bu(x,t)  =  YmIi  ui(t)fi(x,t)  a.e. 


\\K(t)\\ln  <  c||o||r)M^|uj(t)|2,  [di®P] -a.e.  in  (t,u).  (3.4) 


1=1 


The  proofs  of  these  statements  follow  along  the  lines  of  Exercise  3.2.10  and 
Lemma  3.2.3  of  [15].  Next,  define 


F°’u(x,t)  =  f  '  bu(x,s)ds+  f  b(x,s)ds. 
Jo  Jo 


(3.5) 


It  follows  that  F°’u(-,t)  is  a  Cfe’7(Md)-valued  continuous  adapted  stochastic 
process.  Let  bu  =  bu  +  b  and  for  (t o,x)  £  [0,T]  x  Rd  let  {4>t^tix)} t0<t<T  be  the 


unique  solution  of  the  equation 


4>to'!t(x)  =  x+  [  (x),r)dr,  t  G  [t0,T].  (3.6) 

J  to 

From  [15,  Theorem  4.6.5]  it  follows  that  {0°’“,  0<s<t<T}isa  forward  flow 
of  Cfc— diffeomorphisms. 

For  (< i)°,  F°)  G  Wk  x  Wk  define 

I(<t>°,F°)  =  inf  |  /  |.Ks)||?2ds,  (3.7) 

ueC(<i>o,Fo)  2  J o 

where  C((j>°,F°)  =  {u  G  L2([0,T]  :  l2)\(<f>° ,  F°)  =  (<f>0’u,F°’u)}.  Note  in  partic¬ 
ular  that  u  in  (3.7)  is  deterministic.  If  (</>°,F°)  G  {Wk- i  x  Wk- i)  \  (Wk  x  Wk) 
then  we  set  I(<tP,F°)  =  oo.  We  denote  the  restriction  of  I  to  Wk- i  x  Wk- i  by 
the  same  symbol.  The  following  is  the  main  result  of  the  section. 


Theorem  3.1  ( Large  deviation  principle)  The  family  (<fie,Fe)e> o  satisfies  a 
LDP  in  the  spaces  Wk- i  x  Wk- 1  and  Wfc_i  x  Wk- 1  with  rate  function  I. 


Let  {un}ff=1  (un  =  (uPlgi)  be  a  sequence  in  for  some  fixed  N  < 

oo.  Let  {en}n>o  be  a  sequence  such  that  en  >  0  for  each  n  and  en  — ►  0 
as  n  — >  oo.  Note  that  we  allow  en  =  0  for  all  n.  Recall  that  M(x,t)  = 


E“i  fo  fi(x,r)dfli(r),  (XA)  G  Rd  x  [0,  T\.  Define 


Fn(x,  t)  =  /  bUrl(x,r)dr+  yfef  M(x,t), 

Jo 

and  let  (f>n  be  the  solution  to 

4>t(x)=x+  f  bUn(<f>r(x),r)dr  +  y/e^  f  M  (<j%  (x) ,  dr) . 


(3.8) 


(3.9) 


Clearly  Fn  G  Wk,  and  from  [15,  Theorem  4.6.5],  equation  (3.9)  has  a  unique 
solution  (f>n  G  Wk  a.s.  We  next  introduce  some  basic  weak  convergence  defini¬ 
tions. 


Definition  3.2  Let  u  G  AN[h]  and  {4>n}  be  as  above.  Let  Pp_1,P°_1  be  the 
measures  induced  by  (cj)n,  Fn),  (</>°’“,  F0,u)  respectively,  on  Wk- i  x  Wk-i-  Thus 
for  A  G  B(Wk-\  x  Wk-\), 

H-M)  =  P ((</>n,Tn)  G  A),  Pg_i(A)  =  P((0O,“, F0,u)  G  A). 

The  sequence  {(</>",  Fn)}r  is  said  to  converge  weakly  as  Gk  flows  to  (< f>0,u ,  F0,u) 
as  n  — >  oo  z/Pp_1  converges  weakly  to  P°_i  as  n  — >  oo. 
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Definition  3.3  Let  P^_1,P°_1  be  the  measures  induced  by  {(f>n ,  Fn),  F0,“) 
respectively  on  Wk-i  x  Wk- i-  T7ie  sequence  {(<f>n,  Fn)}n>1  sa id  to  converge 
weakly  as  Cfc_1  —  flows  to  {(j)0,u ,  F°’u)  as  n  —>  oo  if  P1^_1  converges  weakly  to 
P°_i  as  n  — >  oo. 

As  noted  in  the  introduction,  proofs  of  large  deviations  properties  based  on 
the  general  framework  developed  in  [8]  essentially  reduce  to  weak  convergence 
questions  for  controlled  analogues  of  the  original  process.  For  our  problem  the 
following  theorem  gives  the  needed  result.  The  proof  is  given  in  the  next  section. 

Theorem  3.4  Let  {un}  converge  to  u  in  distribution  as  an  -S'jv [^2]  —  valued  se¬ 
quence  of  random  variables.  Then  the  sequence  {(<fin ,  Fn)}n>  1  converges  weakly 
as  Cfe_1  —  flows  and  Gfc_1  —  flows  to  the  pair  (<f>°’u ,  F0,u)  as  n —>  00. 

The  following  theorem  is  taken  from  [8].  Let  R°°  denote  the  product  space  of 
countably  many  copies  of  the  real  line.  Then  S  =  G([0,T]  :  K°°)  (with  the  usual 
topology)  is  a  Polish  space  and  (3  =  is  a  S'— valued  random  variable. 

Theorem  3.5  Let  £  be  a  Polish  space,  let  {Ge}s>o  be  a  collection  of  measurable 
maps  from  ( S,B(S ))  to  (£,B(£)),  and  let  Xs  =  Ge(^/e(3).  Suppose  that  there 
exists  a  measurable  map  G°  :  S  — >  £  such  that  for  every  N  <  00  the  set  Tn  = 
{G°  (fQ  u(s)ds )  :  u  £  Sjv^]}  is  a  compact  subset  of  £.  For  f  £  £  let  Cf  =  {u  £ 
L2([0,T]  :  l2)  :  f  =  G°(f0  usds)j.  Then  I  defined  by 

lu)  = 

is  a  rate  function  on  £.  Furthermore,  suppose  that  for  all  N  <  00  and  families 
{u£}  C  djv[y  such  that  ue  converges  in  distribution  to  some  u  £  An\}-2],  we 
have  that  Q£  +  JQue(s)ds )  — >  Q°  (fQ  u(s)ds)  in  distribution  as  e  — >  0. 

Then  the  family  {Xe ,  e  >  0}  satisfies  the  LDP  on  £ ,  as  e  — >  0,  with  rate  function 

I. 

Proof  of  Theorem  3.1.  We  will  only  show  that  the  sequence  ((f>e,Fe)  satisfies 
a  LDP  in  Wk-i  x  Wfc_i  with  rate  function  I  defined  as  in  (3.7).  The  LDP  in 
Wfc_i  x  Wk-i  follows  similarly.  Let  Gs  :  S  —>  Wk-i  x  Wfc_i  be  a  measurable 
map  such  that  Gs{\/£l3)  =  (</>£,  Fe)  a.s.,  where  F£  is  given  by  (3.1)  and  (f£  is  the 
associated  flow  based  on  F£.  Define  G°  ■  S  — >  Wk-i  x  Wk- 1  by  G°(f0  u(s)ds)  = 
((^A0)  if  u  £  L2([0,  T\  :  I2)  and  with  <f°,F0  as  defined  in  (3.6)  and  (3.5), 
respectively.  We  set  G°{f)  =  0  for  all  other  f  £  S. 

Fix  N  <  00  and  consider  T N  =  {^°(/o  u(s)ds),v,  £  Sjvfc]}-  We  first 
show  that  r at  is  a  compact  subset  of  Wk- 1  x  Wk- 1-  For  that  it  suffices  to 
show  that  if  un,u  £  Sn [^2]  are  such  that  un  — >  u,  then  G°(f0  un(s)ds)  — > 
G°(f0  u(s)ds)  in  Wk- 1  x  Wk- 1-  This  is  immediate  from  Theorem  3.4  on  noting 
that  G°(f0  un(s)ds)  =  ((fn,Fn),  where  (fn,Fn  are  as  in  (3.9)  and  (3.8)  respec¬ 
tively  with  en  =  0;  and  G°(f0  u(s)ds)  =  F°’u),  where  cf>0’u,F0'u  are  as  in 

(3.6)  and  (3.5)  respectively. 
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Next  let  {un}  C  Aw[h\  and  en  £  (0,oo)  be  such  that  en  — >  0  and  un  con¬ 
verges  in  distribution  to  some  u  as  n  — >  oo.  In  order  to  complete  the  proof, 
it  is  enough,  in  view  of  Theorem  3.5  and  the  definition  (3.7),  to  show  that 
+  f0  un(s)ds)  ->  G°(f0  u(s)ds)  in  Wk-i  x  Wk-i,  as  n  ->  oo.  An  ap¬ 
plication  of  Girsanov’s  theorem  shows  that  Gen{^/FfP  +  f0  un(s)ds )  =  ( (f>n,Fn ), 
where  <f>n,  Fn  are  defined  as  in  (3.9)  and  (3.8)  respectively.  Also  G°(f0  u(s)ds)  = 
((f>0,u,  F0,u),  where  (f>°’u,F°’u  are  the  same  as  in  (3.6),  (3.5)  respectively.  The 
result  now  follows  from  Theorem  3.4.  □ 


4  Proof  of  Theorem  3.4 

This  section  will  present  the  proof  of  Theorem  3.4.  It  is  worth  recalling  assump¬ 
tions  that  will  be  in  effect  for  this  section,  which  are  that  {«„}  is  converging  to 
u  in  distribution  as  an  Sjvlh]— ' valued  sequence  of  random  variables,  and  that 
(a,  b)  £  Cy'5(Rdxd)  x  Cy,l5(]Rd),  for  some  k  £  N  and  5  £  (0, 1]. 

We  begin  by  introducing  the  ( m  +  p)-point  motion  of  the  flow  and  the 
related  notion  of  “convergence  as  diffusions”.  Let  x  =  (x±,  x2,  ■  ■  • ,  %m)  and 
y  =  (2/1 1 2/2,  ■  •  •  ,2 ip)  be  arbitrary  fixed  points  in  Rdxm  and  Kdxp,  respectively. 
Set 

#(*)  =  ■  ■  ■  A?{Xm)) 

and 

Fn(y,  t )  =  (. Fn(yi,t ),  Fn(y2 ,  *),...,  Fn(yp,  t)) . 

Then  the  pair  {^"(x), Fn(y, 1)}  is  a  continuous  stochastic  process  with  values 
in  ffidxm  x  M.dxp  and  is  called  an  (m  +  p)-point  motion  of  the  flow.  Let  Vm  = 
C([0,T]  :  Kdxm)  be  the  Frechet  space  of  all  continuous  maps  from  [0,T]  to 
]R“xm,  with  the  usual  semi-norms,  and  let  Vm,p  =  Vm  x  Vp  be  the  product 
space. 

Definition  4.1  Let  P"x  y^  and  P°x  y^  be  the  measures  induced  by  (</>n(x),  Fn(y)) 
and  (0°’“(x),  F°’u(y)),  respectively,  on  VmtP.  Thus  for  A  £  B(VmtP) 

P?x,y)  =P((^(x),F”(y))  e  A),  P(x,y)  =P((/’“(x),F°’“(y))  e  A). 

The  sequence  {((/>n,F")}r  >1  is  said  to  converge  weakly  as  diffusions  to  (</>°’“,  F0,u) 
as  n  — >  00  if  P”x  y)  converges  weakly  to  P°xy.j  as  n  — >  00  for  each  (x,  y)  £ 

Rdxm  x  andm,p=  1,2,.... 

The  following  well  known  result  (c.f.,  [15,  Theorem  5.1.1])  is  a  key  ingredient  to 
the  proof  of  Theorem  3.4. 

Theorem  4.2  The  family  of  probability  measures  PJ?. ^{respectively,  P^—1)  con¬ 
verges  weakly  to  probability  measures  ^-^respectively,  PjJ^),  as  n  — >  00  if  and 
only  if  the  following  two  conditions  are  satisfied: 
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1.  the  sequence  {(</>", F")}  >x  converges  weakly  as  diffusions  to  ((f>0’u,F0’u) 
as  n  — >  oo, 

2.  the  sequence  IPfc _  i  !•  ( respectively ,  {lPfc_1 } )  is  tight. 

We  will  show  first  that  under  the  condition  of  Theorem  3.4  the  sequence  Fn)}n>1 
converges  weakly  as  diffusions  to  (</>°’“,  F0,u)  as  n  — >  oo.  We  begin  with  the  fol¬ 
lowing  lemma. 


Lemma  4.3  For  each  x  £ 


E  sup 

0  <t<T 


OO  r-t 

V  /  fk{x,s)d(3k{s) 

k=lJ° 


2 

<  oo, 


sup  E  sup 
n  0  <t<T 


E  /  h{<K{x),s)dpk{s) 


k= 1  * 


2 

<  OO. 


(4.1) 

(4.2) 


Proof.  We  will  only  prove  (4.2).  The  proof  of  (4.1)  follows  in  a  similar  man¬ 
ner.  From  the  Biirkholder-Davis-Gundy  inequality  the  left  hand  side  of  (4.2) 
is  bounded  by 


ciE  V  /  Tr (/,//)  (0r(*),r)dr  =  ClE 

i=i  Jo 

<  c2 1 |a| l'r,fe,<5- 

The  last  expression  is  finite  since  a  belongs  to  Cy'5(Kdxd). 


Tr  (a(<j)r(x),  cf>r(x),r)  )dr 


□ 


An  immediate  consequence  of  Lemma  4.3  is  the  following  corollary  (c.f.  (3.8), 
(3.9)). 


Corollary  4.4  For  each  x  £  Rd  andt  £  [0,T], 

Fn(x,t)=  f  bUn(x,r)dr  +  Sn{x,t) 

Jo 


and  t 

4%(x)=x+  I  bUn((t>r{x),r)dr +  Tn(x,t), 

Jo 

where  S„fx,-)  and  Tn{ x,-)  are  continuous  stochastic  processes  with  values  in 
Rd,  satisfying  sup0<t<T{|S'n(a;,t)|  +  \Tn(x,t)\}  — >  0  in  probability  as  n  — >  oo. 

The  following  lemma,  showing  the  tightness  of  P"x  ,  plays  an  important  role 
in  the  proof  of  the  weak  convergence  as  diffusions. 

Lemma  4.5  For  each  x  £  Rd  the  sequence  {((/>"  (x),F™(x))}  „>i  is  tight  in 
C([0,  T]  :  Rd  x  Rd). 
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Proof.  We  will  only  argue  the  tightness  of  {</>n(a;)}.  Tightness  of  {Fra(a;)}  is 
proved  similarly.  Corollary  4.4  yields  that  Tn(x,  •)  is  tight  in  C([0,T]  :  Rd). 
Thus  it  suffices  to  show  the  tightness  of  { fQ  bUn  (cf™ (x) ,  r) dr} .  Fix  p  >  0. 
From  the  Cauchy-Schwarz  inequality,  (3.4),  and  recalling  that  un  £  Ajsrlh], 
E|/s*  r)dr\P  can  be  bounded  by 


E 


|  bUri  (</>”(*),  r)[  dr 


p/2 


(t  -  s)p/2  <  Cl{||a||?iM  +  |H||,MF/2(t  -  s)^2 
<  c2(t-s)p/2. 


The  result  follows.  □ 

Proposition  4.6  Let  un  — >  u  in  distribution  as  Sjsi[l2\-valued  random  vari¬ 
ables.  Then  the  sequence  {{4>n,  Fn)}n>1  converges  weakly  as  diffusions  to  F0,u) 
as  n  — >  oo . 


Proof.  In  view  of  the  tightness  established  in  Lemma  4.5  and  Corollary  4.4, 
it  suffices  to  show  that  for  each  t  £  [0,T],  the  map  (£,v)  e- >  fQ  bv(t;s,s)ds, 
from  C([0,T]  :  Md)  x  Sjv^]  to  Kd,  is  continuous.  Let  (£n>vn)  —>  (£,v)  in 
C([0, T\  :  Rd)  x  SN[l2].  Then, 


(t>v„(£is)  ~  bv{t,s,s))ds 


< 


[  (bv„(C>s)  ~  bv„(ts,s))ds 
Jo 

/  (bvn(£s>&)  bv(£s,s))ds 
Jo 


(4.3) 


=  L\  +  1/2  • 


For  each  x  G  Rd  we  have  that 

rt 


'0 


( bVn(x ,  5)  —  bv(x ,  s))ds 


00 

/  fi(x’s)(vi'(s)  -  Vi(a))ds 
1= 1  J° 


0,  (4.4) 


since  vn  — >  v  weakly  in  L2([0,T]  :  If)  and 


\fi{x,s)\2ds  <  T||a||riM 


<  00. 


Furthermore  from  (3.4)  (recall  k  >  1)  we  have  that  for  some  ci  £  (0,oo)  and  all 
x,  y  £  Kd,  0  <  t  <  T, 


i>Vrl(y,s))ds 


<  \x-y\  [  (| |6un (s) | |fe,7 

Jo 

<  c±\x  —  y\. 


Il&(s)llfc,7)ds 

(4.5) 


Using  the  Ascoli-Arzela  Theorem  (in  the  spatial  variable)  and  equations  (4.4), 
(4.5)  yield  now  that  the  expression  on  the  left  side  of  (4.4)  converges  to  0 
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uniformly  for  x  in  compact  subsets  of  Rd.  Thus  L2  — >  0  as  n  — >  00.  Following 
similar  arguments  L\  is  bounded  by  C2  sup0<s<T  |£"  —  | ,  which  converges  to  0 

as  n  — >  00.  Hence  (4.3)  converges  to  0  as  n  — >  00  and  the  result  follows.  □ 

We  next  show  the  tightness  of  the  family  of  probability  measures  {PJ?^}. 
Key  ingredients  in  the  proof  are  the  following  uniform  Lp-estimates  on  daFn{x ,  t ) 
and  da(j^{x). 

Lemma  4.7  For  each  p  >  1  there  exists  k\  £  (0, 00)  such  that  for  all  1. 1'  £ 
[0,T],  x  £  n>  1,  and  |a|  <  k: 

E\daFn(x,t)  -daFn(x,t')\p  <  h  \t-t'\p/2  .  (4.6) 

Proof.  Fix  a  multi-index  a  such  that  |a|  <  k  and  p  >  1.  Using  the  Biirkholder- 
Davis-Gundy  inequality  for  the  martingale  daM(x,-)  and  the  fact  that  a  £ 
dfS(M.d),  we  obtain  that  for  some  c\  £  (0,oo)  and  all  x  £  Rd,  t,  t'  £  [0,  T], 

E| daM{x,t)  -  daM(x,t')\p  <  ci  \t  -  t'\p/2  .  (4.7) 

Recalling  that  bUn(-,t)  £  Cfc,T(Rd)  a.e.  (t,w)  and  using  (3.4)  we  get 

/  sup  \dabUri(x,r)\dr  <  00  a.e., 

Jo  ®eRd 

and  thus  da  bUn  (x,  r)dr  =  f*  dabUri  (x,  r)dr  a.e.  An  application  of  the  Cauchy- 
Schwarz  inequality  and  (3.4)  now  gives,  for  some  C2  £  (0,oo), 

p 

<c2|  t-t'\p/2.  (4.8) 

Equation  (4.6)  is  an  immediate  consequence  of  (4.7)  and  (4.8).  □ 

For  g  :  Rd  x  [0,T]  — >  Rd,  let  \7yg(y,r)  be  the  d  x  d  matrix  with  entries 
\yy9ilh  r)\ij  =  Jjp9i(y,r).  Differentiating  with  respect  to  X\  in  (3.9)  we  obtain 

diffi(x)  =  dxx+  [  \yybUn(<f>r{x),r)  ■  d14>rf(x)]dr 
Jo 

+  f  ^yM{(!>rf{x),dr)-di(jf!f{x) 

Jo 

=  d\X+  f  ^ vFn((j)rf(x)1dr)  ■  di(j)rf(x). 

Jo 

By  repeated  differentiation  one  obtains  the  following  lemma  whose  proof  follows 
along  the  lines  of  Theorem  3.3.3  of  [15].  Given  0  <  m  <  k,  let  Am  be  the 
set  of  all  multi-indices  a  satisfying  |a|  <  to.  For  a  multi-index  7,  denote 
by  771(7)  =  tt{7o  :  |7o|  <  111}-  Also  for  a  |7|-times  differentiable  function  T  : 


E 


da  /  bUn(x,r)dr 
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Rd  — >  R,  denote  by  9-l7l'E'(a:)  the  m(7)-dimensional  vector  with  entries  d70']/^), 
Ibol  <  |t|-  If  ^  =  ('hi,  tf2)  •  ■  ■ ,  :  Rd  — >  Rd  is  such  that  each  47  is  |7|-times 

continuously  differentiable  then  d-l7^^)  =  (<9^l7l'F1(x), . . . ,  d-l'i'I'F^x)).  We 
will  call  a  map  P  :  Rm  — >  Rd  a  polynomial  of  degree  at  most  p  if  P(x )  = 
(Pi(x), . . . ,  Pd(x))'  and  each  Pt  :  Rm  — »  R  is  a  polynomial  of  degree  at  most  p. 
Also  for  u,  v  £  R;  we  define  u*  v  =  (u\Vi, . . . ,  uivi)' . 

Lemma  4.8  Let  a,  (3,^  be  multi-indices  such  that  |a|,  |/3|,  I7I  <  k.  Then  there 
exist  subsets  A^,  Kf  of  A|a|  and  K.\a\_x  respectively ,  a  subset  o/A|7|  and 
polynomials  Pjf^  :  Rm(7)  — ►  Rd  of  degree  at  most  |a|  ,  such  that  da(j)n  satisfies: 

daW(x)  =  dax+  [  G”(a“^(x),C(x), 

Jo 

+  E  t  G;:;{d^W(x),W(x),dr),  (4.9) 

(/3,7)€Ai  xA^  Jo 

where  for  x,y  £  Rd,  Gn{x1y,r)  =  VvFn(y,r)  ■  x  and  for  (x,y)  £  Rm^7^  x 
Rd,  G^(x,y,r)  =  P^(x)  *  d$Fn(y,r). 

Note  in  particular  that  in  the  third  term  on  the  right  hand  side  of  (4.9),  one 
finds  partial  derivatives  of  <^"(x)  of  order  strictly  less  than  |a|. 

Lemma  4.9  For  each  p>  1,  L  £  (0,oo),  there  is  a  constant  k\  =  ki(k,p,L)  £ 
(0,oo)  such  that  for  every  multi-index  a ,  |a|  <  k 

sup  sup  E  sup  |dQ</>”(x)| p  <  jfei 

n  \x\<L  0  <t<T 

sup  sup  E|<9“</>™(x)  —  da(jjf,{x)\p  <  k\  \t  —  t! \p^2 

n  \X\<L 

Proof.  Fix  L  >  0  and  consider  x  £  Rd  such  that  \x\  <  L.  We  will  first  show 
inequality  (4.10).  It  suffices  to  prove  (4.10)  for  a  =  0  and  establish  that  if,  for 
some  m  <  k,  it  holds  for  da(j)f  with  |a|  <  m  and  all  p  >  1  then  it  also  holds 
for  d ida(j)f  with  all  p  >  1  (with  a  possibly  larger  constant  fci)  and  i  =  1, . . . ,  d. 
The  desired  result  then  follows  by  induction. 

Consider  first  a  =  0.  For  this  case  the  bound  in  (4.10)  follows  immedi¬ 
ately  on  using  (3.4)  and  applying  the  Biirkholder-Davis-Gundy  inequality  to 
the  square  integrable  martingale  Nt  =  f0M((/>™(x),dr)  [note  that  {(N))t  = 

f*  a(</>"(x),  cf>r(x),  r)dr  and  a  £  C^,'5(Rdxd)]. 

Now,  suppose  that  (4.10)  holds  for  all  multi-indices  a  with  |a|  <  to,  for  some 
to  <  k.  Fix  a  with  |a|  <  to,  an  *  £  {1,2, . . .  ,  d},  and  consider  the  multi-index 
a  =  a  +  lj,  where  1  ,  is  a  d-dimensional  vector  with  1  in  the  ith  entry  and  0 
elsewhere.  From  Lemma  4.8,  one  finds  that  dacj)f  solves  (4.9)  for  a  =  a.  Note 
that  for  (3  £  A~, 


(4.10) 

(4.11) 
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From  (3.4)  and  recalling  that  (6,  a)  £  C^S(M.d)  x 
some  ci,C2  £  (0,oo), 


sup  sup 

o <t<T  yeRd 


dybu(y,  s)ds 


<  ci  and  sup  sup 
o<t<Tymd 


),  we  have  that  for 

«))). 


<  C2- 


This  along  with  the  assumption 


sup  sup  E  sup  \du (ft (x)\p  <  k\  for  u,  \v\  <  |a| 

n  |x|<i  0  <t<T 

shows  that  for  some  C3  £  (0,oo),  for  all  ((3, 7)  £  Al  x  A| 


sup  sup  E  sup 

n  |x|<L  0  <t<T 


<  c3. 


Also,  in  a  similar  manner  one  has  for  some  C4  £  (0,  00) 


E  sup 

0  <t<T 


Gn{da4>r{x),4,r{x)idr)  <  C4  f  E  (  sup  \da<j>^{x)\ ^  ds. 

JO  \0<r<s  / 


Combining  the  above  inequalities  we  obtain 
sup  sup  E  sup  \dacj)g(x)\p  <  C3  +  C4SUP  sup  [  E  (  sup  \dac/)™(x)\p  )  ds. 

n  \x\ <L  0 <s<t  n  \x\<L  J 0  \0<r<s 


Now  an  application  of  Gronwall’s  lemma  shows  that  for  some  C5  G  (0,  00) 
sup  sup  E  sup  \d&^{x)\p  <  C5. 

n  \x\  <L  0  <t<T 

This  establishes  (4.10)  for  all  a  with  |d|  <  |a|  +  1.  Finally  consider  (4.11).  For 
t,  t!  £  [0,T],  t!  <  t,  we  have  from  (4.9)  that 

daW{x)  -daW(x)  =  J*  Gn(daW(x),ff(x),dr) 

+  E  ;),^(®),dr).  (4.12) 

(<3,7)6Ai  xA2 

Using  (4.10)  on  the  right  hand  side  of  (4.12)  we  now  have  (4.11)  via  an  appli¬ 
cation  of  Holder’s  and  Biirkholder-Davis-Gundy’s  inequalities.  □ 


The  proof  of  Theorem  3.4  proceeds  along  the  lines  of  Section  5.4  of  [15].  We 
begin  by  introducing  certain  Sobolev  spaces.  Let  j  be  a  non-negative  integer 
and  let  1  <  p  <  oo.  Let  B jy  =  B( 0,  N)  be  the  Kd-ball  with  center  the  origin  and 
radius  N.  Let  h  :  M.d  — >  be  a  function  such  that  the  distributional  derivative 
d°h  £  Lp(Bn)  for  all  a  such  that  |a|  <  j.  Define 


1 1^1 1  j,p:N  ~ 


1/p 
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The  space  Hl°f  =  {h  :  Kd  — >  Rd,  ||/i| |j,p:JV  <  oo  for  all  N}  together  with  the 
seminorms  defined  above  is  a  real  separable  semi-reflexive  Frechet  space.  By 
Sobolev’s  imbedding  theorem,  we  have  Hl°+ lp  C  CJ(Md)  C  Hl°p  if  p  >  d. 
Furthermore  the  imbedding  i  :  — »  CJ  (Kd)  is  a  compact  operator  by  the 

Rellich-Kondrachov  theorem  (see  [1]). 

Proposition  4.10  The  sequence  {(<(>”,  Fn)}„> i  is  tight  in  Wk-\  x  H4_i 

Proof.  It  suffices  to  show  that  both  {(j)n}n>i  and  {F’n}n>i  are  tight  in  Wk- 1- 
We  will  use  Kolmogorov’s  tightness  criterion  [15,  Theorem  1.4.7,  p.  38].  From 
Lemmas  4.7  and  4.9,  we  have  that  for  each  p  >  1,  N  >  1,  there  exist  Ci,C2  £ 
(0,oo)  such  that  for  all  t,t'  £  [0,T] 

supE||0"  -  <ff, \\pktP:N  <  ci  | t-t'\p/2, 

n 

supE||F"(-,t)  -  Fn(;t,)\\pkp:N  <  c2  \t-t'\p/2  . 

n 

Furthermore,  since  Fn(-,  0)  =  0  and  4>q{x)  =  x,  we  get  that  for  each  p  >  1,  N  > 
1  there  exist  C3,  C4  £  (0,  00)  such  that 

supE||$*||£  ^  <  c3  and  supE||Fm(f)||j’i p.N  <  c4. 

n  n 

Theorem  1.4.7  of  [15]  now  gives  tightness  in  the  semiweak  topology  on  Hl£p 
(cf.  [15]).  Since  the  imbedding  map  i  :  Hjfp  — >  Cfe_1  is  compact,  tightness  in 
Wk-i  x  Wk- 1  with  the  topology  introduced  in  Section  2  follows  (see  [15,  pp. 
246-247]).  □ 

Recall  the  definitions  (3.2)  and  (3.3).  For  the  proof  of  the  following  lemma  we 
refer  the  reader  to  Section  2.1  of  [4]. 

Lemma  4.11  Let  /„,  /  e  Wk-i  be  such  that  sup0<t<TAfc_i (fn(t),f(t))  ->  0, 
as  n  — >  00.  Then  sup0<t<T  c4_i(/n(t), /(f))  ->  0. 

Proof  of  Theorem  3-4-  Convergence  as  Cfe_1  flows  is  immediate  from  Theo¬ 
rem  4.2,  Proposition  4.6  and  Proposition  4.10.  Using  Skorohod’s  representa¬ 
tion  theorem,  one  can  find  a  sequence  of  pairs  {(</>",  Fn)}„>  1  which  has  the 
same  distribution  as  {(4>n,  Fn)}n>i  and  {((/°,.F0)}  which  has  the  same  distri¬ 
bution  as  {(</>°,  F0)}  and  sup0<t<T  [Afc($),  <$)  +  A k(Fn (t) ,  F° (t))\  ->  0,  a.s. 
Since  <fn,<f>0  £  Wk  a.s.,  the  same  holds  for  />", <fi°.  Thus  from  Lemma  4.11 
sup0<t<T  dk-i(4>f,  <ft)  0  a.s.  Hence  ((f>n,Fn)  — >  (cj)0,F0)  as  Gfe_1-flows.  □ 

5  Application  to  image  analysis 

A  common  approach  to  image  matching  problems  (see  [14],  [17],  [12]  and  ref¬ 
erences  therein)  is  to  consider  a  Rp-valued,  continuous  and  bounded  function 
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T(-),  referred  to  as  the  “template”  function,  defined  on  a  bounded  open  set 
OCR3,  which  represents  some  canonical  example  of  a  structure  of  interest.  By 
considering  all  possible  smooth  transformations  h  :  O  — >  O  one  can  generate  a 
rich  library  of  targets  (or  images)  given  by  the  form  T(/i(-)). 

In  typical  situations  we  are  given  data  generated  by  an  a  priori  unknown 
function  h,  and  the  key  question  of  image  matching  is  that  of  estimating  h  from 
the  observed  data.  A  Bayesian  approach  to  this  problem  requires  a  prior  distri¬ 
bution  on  the  space  of  transformations  and  a  formulation  of  a  noise/data  model. 
The  “maximum”  of  the  posterior  distribution  on  the  space  of  transformations 
given  the  data  can  then  be  used  as  an  estimate  h  for  the  underlying  unknown 
transformation  h.  In  certain  applications  (e.g.,  medical  diagnosis),  the  goal  is  to 
obtain  numerical  approximations  for  certain  key  structures  present  in  the  image, 
such  as  volumes  of  subregions,  curvatures  and  surface  areas.  If  the  prior  distri¬ 
bution  on  the  transformations  (and  in  particular  the  estimated  transformation) 
is  on  the  space  of  diffeomorphisms,  then  this  information  can  be  recovered  from 
the  template.  Motivated  by  such  a  Bayesian  approach  a  variational  problem  on 
the  space  of  diffeomorphic  flows  was  formulated  and  analyzed  in  [12]. 

Before  going  in  to  the  description  of  this  variational  problem,  we  note  that 
although  the  chief  motivation  for  the  variation  problem  studied  in  [12]  came  from 
Bayesian  considerations,  no  rigorous  results  on  relationships  between  the  two 
formulations  (variational  and  Bayesian)  were  established.  The  goal  of  our  study 
is  to  develop  a  rigorous  asymptotic  theory  that  connects  a  Bayesian  formulation 
for  such  an  image  matching  problem  with  the  variational  approach  taken  in  [12]. 
The  precise  result  that  we  will  establish  is  Theorem  5.1,  given  at  the  end  of  this 
section. 

Let  Cfi°(0)  be  the  space  of  infinitely  differentiable,  real-valued  functions  on 
O  with  compact  support  in  O.  The  starting  point  of  the  variational  formulation 
is  a  differential  operator  L  on  [Cq°(C>)]3,  the  exact  form  of  which  is  determined 
from  specific  features  of  the  problem  under  study.  The  formulation,  particularly 
for  problems  from  biology,  often  uses  principles  from  physics  and  continuum 
mechanics  as  a  guide  in  the  selection  of  L.  We  refer  the  reader  to  Christensen 
et.  al.  [9],  [10],  where  natural  choices  of  L  in  shape  models  from  anatomy  are 
provided. 

Define  the  norm  ||  ■  \\l  on  [Cq°(C>)]3  by 


3 

E 

i= 1  ' 


\(Lf)i(u)\2du, 


o 


where  we  write  a  function  g  e  [Cq°(C?)]3  as  (<71,(72,33)  •  It  is  assumed  that  ||  ■  \\l 
generates  an  inner  product  on  [Cg°(0)]3  and  that  the  Hilbert  space  H  defined 
as  the  closure  of  [C“(C>)]3  with  this  inner  product  is  separable.  We  will  need 
the  functions  in  H  to  have  sufficient  regularity  and  thus  assume  that  the  norm 
||  •  || l  dominates  an  appropriate  Sobolev  norm.  More  precisely,  let  W™+2'2(0) 
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be  the  closure  of  C“(0)  with  respect  to  the  norm 

\\9\\w^{0)=(f  E  \dag(u)\2du)1/2,  gGCZr(O),  (5.1) 

J°  \a\<m+2 

where  a  denotes  a  multi-index  and  m  >  3.  Define  Vm  =  \W™+  where 

(8)  is  used  to  denote  the  usual  tensor  product  of  Hilbert  spaces.  We  denote  by 
||  •  |  ym  the  norm  on  Vm.  The  main  regularity  condition  on  L  is  the  following 
domination  requirement  on  the  ||  •  |k  norm.  There  exists  a  constant  c  £  (0,cx)) 
such  that 

I l/lk  —  cll/llvm  for  all  /  £  [C^°(0)]3. 

This  condition  ensures  that  H  C  Cm,1/2(0)  (see  [1,  Theorem  4.12  parts  II  and 
III,  p.  85]).  We  denote  by  H  the  Hilbert  space  L2([ 0, 1]  :  H).  For  a  fixed  b  £  7~i 
let  {??s,t(a;)}s<t<i  be  the  unique  solution  of  the  ordinary  differential  equation 

dllSg^  =  b(jiStt(x),t),  r]s,s(x)  =  x,  0  <  s  <  t  <  1.  (5.2) 

Then  it  follows  that  {rjs,t,  0  <  s  <  t  <  T}  is  a,  forward  flow  of  (Cm-diffeomorphisms 
on  O  (see  [15,  Theorem  4.6.5,  p.  173]).  Since  b{-,t)  has  a  compact  support  in 
O,  one  can  extend  r]s>t  to  all  of  R3  by  setting  gs,t{x)  =  x,  if  x  £  Oc.  Extended 
in  this  way  r]s^t  can  be  considered  as  an  element  of  Gm ,  as  defined  in  Section  3. 
Denoting  770,1  by  hb ,  we  can  now  generate  a  family  of  smooth  transformations 
(diffeomorphisms)  on  O  by  varying  b  £  Ti.  Specifically,  the  library  of  trans¬ 
formations  which  is  used  in  the  variational  formulation  of  the  image  matching 
problem  is  {hb\b  £  H}. 

We  now  describe  the  data  that  is  used  in  selecting  the  transformation  hb-  for 
which  the  image  T[hf(-))  best  matches  the  data.  Let  £  be  a  finite  index  set  and 
{Xi}ie£  be  a  collection  of  disjoint  subsets  of  O  such  that  =  O.  Collected 

data  {dj}ie£  represents  integrated  responses  over  each  of  the  subsets  Xi,  i  £  C. 
More  precisely,  if  T(/i(-))  was  the  true  underlying  image  and  the  data  were 
completely  error  free  and  noiseless,  then  di  =  fx  T(h(a))da/vo\(Xi),  i  £  £, 
where  vol  denotes  volume.  Let  d  =  ^2,  dn)' ,  where  n  =  \C\.  Defining 

Yd(x)  =  di,  x  £  Hi,  i  £  £,  the  expression 

\  I  \T(hb{x))  -  Yd(x)\2dx 

is  a  measure  of  discrepancy  between  a  candidate  target  image  T(hb(-))  and 
the  observations.  This  suggests  a  natural  variational  criterion  for  selecting  the 
“best”  transformation  matching  the  data.  The  objective  function  that  is  mini¬ 
mized  in  the  variational  formulation  of  the  image  matching  problem  is  a  sum  of 
two  terms,  the  first  reflecting  the  “likelihood”  of  the  transformation  or  change- 
of- variable  hb  and  the  second  measuring  the  conformity  of  the  transformed  tem¬ 
plate  with  the  observed  data.  More  precisely,  define  for  b  £  H 

Mb)  =  \  (| |6|  \2n  +  J  \T(hb(x))  -  Yd(x)\2dx) .  (5.3) 
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Then  b*  £  argmin b&nJd(b),  represents  the  “optimal”  velocity  field  that  matches 
the  data  d  and  for  which  the  hb «,  obtained  by  solving  (5.2),  gives  the  “optimal” 
transformation.  This  transformation  then  yields  an  estimate  of  the  target  image 
as  T(hb*(-))-  Equivalently,  defining  for  each  h  £  G° 

Jd{h)=  inf  Jd{b)  (where  =  {b  £  H  :  h  =  hb}), 

be^h 

we  see  that  an  optimal  transformation  is  h*  =  hb *  £  argmin^  Jd(h). 

Up  to  a  relabeling  of  the  time  variable,  the  above  variational  formulation  (in 
particular  the  cost  function  in  (5.3))  was  motivated  in  [12]  through  Bayesian  con¬ 
siderations,  but  no  rigorous  justification  was  provided.  [In  [12]  the  orientation 
of  time  is  consistent  with  the  change-of-variable  evolving  toward  the  identity 
mapping  at  the  terminal  time.  To  relate  the  variational  problem  to  stochastic 
flows  it  is  more  convenient  to  have  the  identity  mapping  at  time  zero.]  We  next 
introduce  a  stochastic  Bayesian  formulation  of  the  image  matching  problem  and 
describe  the  precise  asymptotic  result  that  we  will  establish. 

Let  {pi}  be  a  complete  orthonormal  system  in  H  and  /?  =  (Pi)^  be 
as  in  Section  2,  a  sequence  of  independent,  standard,  real-valued  Brownian 
motions  on  some  filtered  probability  space  (fi,  T,  P,  {Ft})-  Write  (C([0,T]  : 
K°°),  B  (C([0,  T)  :  K°°)))  =  (S',  < S),  and  note  that  /3  is  a  random  variable  with 
values  in  S.  Consider  the  stochastic  flow 

OO 

d^s,t(x)  =  ^ s,s(x )  =  £,  x  e  O,  0  <  s  <t  <1,  (5.4) 

2  —  1 

where  e  £  (0,oo)  is  fixed.  From  Maurin’s  theorem  (see  [1,  Theorem  6.61,  p. 
202])  it  follows  that  the  imbedding  map  H  — >  Vm-2  is  Hilbert-Schmidt.  Also, 
Vm_ 2  is  continuously  embedded  in  Cm_2,1/2(C)).  Thus  for  some  ki,k2  £  (0,oo) 
and  all  it,  x,y  £  O, 


J2  \Mu)\2  <  h  ll0i|lvm-2  <  °°> 

2—1  2—1 


J2\MX)  -  My)\2  <  ~  y\2^2\\Pi\\lm_2 

2=1  2=1 


k2\x-  y\2. 


One  also  has  that  if  pi  is  extended  to  all  of  M3  by  setting  pi(u)  =  0,  for  all 
x  £  Oc,  then  a(x,y)  =  is  in  C™~2,1^2(K3x3).  Thus  it  follows 

(cf.  [15,  pages  80  and  106])  that 


oo 

F(x,t)  =  /  Mx)dPi(r) 

i=i  Jo 

is  a  Cm_2,iy-Brownian  motion,  0  <  v  <  1/2,  with  local  characteristics  (a,  0). 
Also  (5.4)  admits  a  unique  solution  {t/f  t(x),  0  <  s  <  t  <  1}  for  each  x  £  O  and 
{V’L}  o<«<t<i  is  a  forward  flow  of  Cfe-diffeomorphisms,  with  k  =  m  —  2,  (see 
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[15,  Theorem  4.6.5]).  In  particular,  Xs  =  tpQ  1  is  a  random  variable  in  the  space 
of  Cfc-diffeomorpshisms  on  O.  The  law  of  Xe  (for  a  fixed  e  >  0)  on  Gk  will 
be  used  as  the  prior  distribution  on  the  transformation  space  Gk .  Note  that 
T(Xe(-))  induces  a  measure  on  the  space  of  target  images. 

We  next  consider  the  data  model.  Let  C  and  n  be  as  introduced  below  (5.2). 
We  suppose  that  the  data  is  given  through  an  additive  Gaussian  noise  model: 

°i  =  J  T(XE(x))dx  +  ^i 

where  i  S  £}  is  a  family  of  independent,  p-dimensional  standard  normal 
random  variables. 

In  the  Bayesian  approach  to  the  image  matching  problem  one  considers  the 
posterior  distribution  of  Xe  given  the  data  D  and  uses  the  “mode”  of  this  dis¬ 
tribution  as  an  estimate  for  the  underlying  true  transformation.  More  precisely, 
let  {Te}e>0  be  a  family  of  measurable  maps  from  to  V(Gk )  (the  space  of 
probability  measures  on  Gfc),  such  that 

TE(A\D)  =  V[XE  £  A\D]  a.s.  for  all  A  £  B(Gk). 

We  refer  to  re(-|d)  as  a  regular  conditional  probability  distribution  (r.c.p.d.)  of 
Xs  given  D  =  d.  In  Theorem  5.1  below,  we  will  show  that  there  is  a  r.c.p.d. 
{TE(-\d),  d  £  R"P}e>0  such  that  for  each  d  £  Rnp,  the  family  {re(-|d)}£>o, 
regarded  as  elements  of  V(Gk~1)  D  V(Gk),  satisfies  a  LDP  with  rate  function 

Id{h)  =  Jd(h)  -  Ad,  where  \d  =  inf  Jd(h)  =  inf  Jd(b). 

/jGG''-1  bGH 

Jd(h) 

Formally  writing  rE(A|d)  ss  J4e~  *  dh,  one  sees  that  the  for  small  e,  the 
“mode”  of  the  posterior  distribution  given  D  =  d,  which  represents  the  “optimal 
transformation”  in  the  Bayesian  formulation,  can  be  formally  interpreted  as 
argminft/d(/i).  Note  that  Jd{h)  =  oo  if  h  (j  Gm  (recall  m  =  k  +  2).  Theorem 
5.1  in  particular  says  that  h  £  Gm  is  a  <5— minimizer  for  Id(h)  if  and  only  if  it  is 
also  a  (i-minimizer  for  Jd{h).  Thus  Theorem  5.1  makes  precise  the  asymptotic 
relationship  between  the  variational  and  the  Bayesian  formulation  of  the  above 
image  matching  problem. 

Theorem  5.1  There  exists  an  r.c.p.d.  TE  such  that  for  each  d  £  M",  the  family 
of  probability  measures  {re(d)}£>0  on  Gk~1  satisfies  a  large  deviation  principle 
(as  e  — >  0)  with  rate  function 

Id(h )  =  Jd(h)  -  Ad-  (5.5) 

We  begin  with  the  following  proposition.  Let  I  :  Gk~ 1  — >  [0,oo]  be  defined 
as 

^0)  =  ,inTf  \Mh- 

bG't'h  Z 
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Proposition  5.2  The  family  {Xe}£>o  satisfies  a  LDP  in  Gk  1  with  rate  func¬ 
tion  I . 

Proposition  5.2  is  consistent  with  results  in  Section  3  in  that  although  the 
local  characteristics  are  in  Ck  and  Xe  £  Gk,  the  LDP  is  established  in  the 
larger  space  Gk~1 .  This  is  due  to  the  tightness  issues  described  in  the  Intro¬ 
duction.  Furthermore,  as  noted  below  (5.2),  if  ||6||^  <  oo  then  b  induces  a  flow 
of  Cm-diffeomorphisms  on  O.  Thus  if  h  £  Gk~1  \  Gm  then  is  empty,  and 
consequently  1(h)  =  oo.  Hence  there  is  a  further  widening  of  the  “gap”  be¬ 
tween  the  regularity  needed  for  the  rate  function  to  be  finite  and  the  regularity 
associated  with  the  space  in  which  the  LDP  is  set.  This  is  due  to  the  fact  that 
the  variational  problem  is  formulated  essentially  in  terms  of  L2  norms  of  deriva¬ 
tives,  while  in  the  theory  of  stochastic  flows  as  developed  in  [15]  assumptions 
are  phrased  in  terms  of  L°°  norms. 


Proof  of  Proposition  5.2.  From  Theorem  3.1  and  an  application  of  the  contrac¬ 
tion  principle  we  have  that  {£Ye}£>0  satisfies  LDP  in  Gk~ 1  with  rate  function 

I*(h)=  inf  Jr  I  || u(s)\\fds, 

uec*(h )  z  j o 

where  C*(h)  =  {u  £  L2([0,1]  :  h)\h  =  </>0,“(l)}  and  where  fi°’u  is  defined  via 
(3.6),  but  with  /,;  there  replaced  by  f>i.  Note  that  there  is  a  one  to  one  correspon¬ 
dence  between  u  £  L2([0,1]  :  I2)  and  b  £  H  given  as  b(t.,x)  =  ui(t)(f>i(x) 

and  /0T  ||u(s)||22ds  =  \\b\fa.  In  particular  u  £  C*(h)  if  and  only  if  b  £  Thus 
I*(h)  =  1(h)  and  the  result  follows.  □ 

Proposition  5.3  For  each  d  £  K”,  Id  defined  in  (5.5)  is  a  rate  function  on 
Gk-\ 


Proof.  From  (5.5)  and  the  definition  of  I  we  have  for  h  £  Gk  1  that 
Id(h)  =  1(h)  +  l  [  | T(h(x))  -  Yd(x)\2dx 

z  JO 

-  hecf-i  +  \  ja  ~  *<*)!’*}  • 

From  Proposition  5.2,  I  is  a  rate  function  and  therefore  has  compact  level  sets. 
Additionally  T  is  a  continuous  and  bounded  function  on  O.  The  result  follows. 

□ 

Proof  of  Theorem  5.1.  We  begin  by  noting  that  r£(-|d)  defined  as 


re(A|d) 


j-  e-i  E?=i| di~fXi  T(h(v))dy\ 


r  ~f;T.i=i\di-fx.T(h(y))dy 

Qk-l  C 


pE(dh) 

72 

I  pe(dh) 
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where  fie  =  P  o  (Xe)_1  £  V(Gk~ 1),  is  a  r.c.p.d.  of  Xe  given  D  =  d.  Using 
the  equivalence  between  Laplace  principle  and  large  deviations  principle  (see 
[11,  Section  1.2])  it  suffices  to  show  that  for  all  continuous  and  bounded  real 
functions  F  on  Gfc_1, 


-elog 


)Gk~ 1 


exp 


Te{dv\d ) 


(5.6) 


converges  to  miheGk-i{F(h)  +  Id(h)}.  Note  that  (5.6)  can  be  expressed  as 

k-J*.  T(h(y))dy\ 


-  elog 
+  elog 


'  Gfe_1 


r  T(h(v))dv\  ] 

J  Gk_1 


HE(dh) 

He(dh). 


From  Proposition  5.2  we  see  that  the  first  term  converges  to 
JgL  jj(^)  +  F(h)  +1-J2dl~j^  T(h(y))dy  | 

=  +  \\\bWu  +  \  [  \T(Hy))  ~Yd{y)\2dy 

heG1*-1  feeth  (  2  2  Jo 


(5.7) 


which  is  Jd(h).  Likewise,  the  second  term  of  (5.7)  converges  to  —  \d.  This  proves 
the  result.  □ 


Notational  conventions. 

•  Transpose  of  a  d-dimensional  vector  v  will  be  denoted  by  v' . 

•  K  CC  Rd  means  that  K  is  a  compact  subset  of 

•  Given  a  k  and  m  dimensional  continuous  local  martingales  M,  N  on  some 
filtered  probability  space  (0,  F,  P,  {Ft}),  we  write  the  cross-quadratic 
variation  of  M  and  N  as  ((M,  N))t,  and  write  ((N))t  when  M  =  N. 
This  is  a  continuous  Kmxfe— valued  {Ft}— adapted  process. 

•  Borel  er  fields  on  a  Polish  space  £  will  be  written  as  B(£). 

•  Generic  constants  will  be  denoted  as  Ci,C2, . . ..  Their  values  may  change 
from  one  proof  to  the  next. 

•  The  infimum  over  an  empty  set  is  taken  to  be  oo. 
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